Corpus: bel_news_2020_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 90 92 95 95 96
1000 609 705 785 918 945
10000 6061 8336 9122 9591 9761
100000 36192 71798 87716 94434 97447
1000000 36192 71799 87717 94435 97448


Zipf's diagram for sentence endings


Gnuplot diagram

9448 msec needed at 2021-03-30 16:05